Researchers should make data freely accessible.

نویسنده

  • Robert D Herbert
چکیده

3 Editorial If you have ever had the pleasure of leafing through old journals you would appreciate that the journals of yesteryear were quite different. A typical article from a scientific journal of the early 20th Century was much longer than those of today. Many articles in old journals contained lots of tables but few graphs. (The graph is a remarkably recent innovation.) And, surprisingly to the modern reader, it was not uncommon for researchers to publish a complete record of all of the data from a particular experiment. That is, researchers often reported data from each subject. These days journal articles rarely report data from individual subjects. Instead, it is more usual to see summary statistics reported. For example, researchers may report the mean and standard deviation of a distribution. Descriptive statistics such as means and standard deviations, when used appropriately, provide a concise summary which substitutes for a tedious enunciation of each datum. The convention of reporting summary statistics, rather than 'raw' data, is a pragmatic one. Modern readers are faced with unmanageably large amounts of research data so they prefer to read concise research reports. Moreover, contemporary clinical studies are very large. Obviously it would be impossible to provide, in hard copy, data for each of the 38 050 participants in the observational study of low back pain reported by Smith and colleagues in this journal (Smith et al 2006). The sensible shortcut is to report only summaries of data. There are, however, reasons why some readers might want access to raw data. Access to raw data makes it possible to: 1. Scrutinise data. By inspecting raw data readers can ascertain how complete the data set is and identify anomalies in the data such as outliers. This provides an indication of data quality that may not be apparent in summary statistics. 2. Re-analyse data. Some published statistical analyses (perhaps most often the simplest analyses) are performed incorrectly. Even when the analysis is conducted correctly, it may be suboptimal. When raw data are available it is possible to check the accuracy of an analysis or to subject the data to better analyses. 3. Incorporate data in meta-analyses. Ideally most quantitative research data would eventually be incorporated in a meta-analysis. But meta-analysis is often thwarted by incomplete reporting of data. This problem could be circumvented if meta-analysts routinely had access to raw data. Access to raw data also opens …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Junior scientists are sceptical of sceptics of open access: a reply to Agrawal.

Anurag A. Agrawal [1] recently published a letter in TIPS in which he suggested four points that researchers should consider when choosing to publish open access (OA). Although a critical evaluation of the pros and cons of publishing OA are warranted and important, three other points should also be considered when discussing OA. First, it is important not to confuse OA with OA publishing. To th...

متن کامل

Review: Practical Design and Analysis of 2-Colour cDNA Microarray Experiments

This review paper, is aimed at biological researchers who are interested in or have begun to use cDNA microarrays for their investigations. Large microarray studies typically involve a multidisciplinary team with various groups performing different aspects of the same experiment. This approach means that microarrays are less accessible to new researchers than more traditional biological techniq...

متن کامل

Supporting Science through the Interoperability of Data and Articles

Whereas it is established practice to publish relevant findings of a research project in a scientific article, there are no standards yet as to whether and how to make the underlying research data publicly accessible. According to the recent PARSE.Insight study of the EU, over 84% of scientists think it is useful to link underlying digital research data to peer­reviewed literature.[1] This tren...

متن کامل

Bayesian Stochastic Frontier Analysis Using WinBUGS

Markov chain Monte Carlo (MCMC) methods have become a ubiquitous tool in Bayesian analysis. This paper implements MCMC methods for Bayesian analysis of stochastic frontier models using the WinBUGS package, a freely available software. General code for cross-sectional and panel data are presented and various ways of summarizing posterior inference are discussed. Several examples illustrate that ...

متن کامل

Swedish National Data Service's Strategy for Sharing and Mediating Data

resources for researchers to document and make their data accessible for others as the most important obstacle. Concerning interventions to enhancing reuse of digital data, the majority of the doctoral students and the professors thought it should be effective to get more information about accessible research data in data archives or databases. Nearly 100% in both groups reported that more trai...

متن کامل

"In vivo" spam filtering: A challenge problem for data mining

Spam, also known as Unsolicited Commercial Email (UCE), is the bane of email communication. Many data mining researchers have addressed the problem of detecting spam, generally by treating it as a static text classification problem. True in vivo spam filtering has characteristics that make it a rich and challenging domain for data mining. Indeed, real-world datasets with these characteristics a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • The Australian journal of physiotherapy

دوره 54 1  شماره 

صفحات  -

تاریخ انتشار 2008